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Abstract 

A number of recent papers have addressed the algorithmic problem of allocating advertise- 
ment space for keywords in sponsored search auctions so as to maximize revenue, most of which 
assume that pricing is done via a first-price auction. This does not realistically model the Gen- 
eralized Second Price (GSP) auction used in practice, in which bidders pay the next-highest 
bid for keywords that they are allocated. Towards the goal of more realistically modelling these 
auctions, we introduce the Second-Price Ad Auctions problem, in which bidders' payments are 
determined by the GSP mechanism. 

We show that the complexity of the Second-Price Ad Auctions problem is quite different 
than that of the more studied First-Price Ad Auctions problem. First, unlike the first-price 
variant, for which small constant-factor approximations are known, we show that it is NP-hard 
to approximate the Second-Price Ad Auctions problem to any non-trivial factor, even when the 
bids are small compared to the budgets. Second, we show that this discrepancy extends even 
to the 0-1 special case that we call the Second-Price Matching problem (2PM). In particular, 
offline 2PM is APX-hard, and for online 2PM there is no deterministic algorithm achieving a 
non-trivial competitive ratio and no randomized algorithm achieving a competitive ratio better 
than 2. This stands in contrast to the results for the analogous special case in the first-price 
model, the standard bipartite matching problem, which is solvable in polynomial time and which 
has deterministic and randomized online algorithms achieving better competitive ratios. On the 
positive side, we provide a 2-approximation for offline 2PM and a 5.083-competitivc randomized 
algorithm for online 2PM. The latter result makes use of a new generalization that we prove of 
a classic result on the performance of the "Ranking" algorithm for online bipartite matching. 
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1 Introduction 



The rising economic importance of online sponsored search advertising has led to a great deal 
of research focused on developing its theoretical underpinnings. (See, e.g., [18] for a survey). 
Since search engines such as Google, Yahoo! and MSN depend on sponsored search for a significant 
fraction of their revenue, a key problem is how to optimally allocate ads to keywords (user searches) 
so as to maximize search engine revenue PQ El El El El IS] 1201 EU IMl • Most of the research on 
the dynamic version of this problem assumes that once the participants in each keyword auction 
are determined, the pricing is done via a first-price auction; in other words, bidders pay what they 
bid. This does not realistically model the standard mechanism used by search engines, called the 
Generalized Second Price mechanism (GSP) \10 \ I25j. 

In an attempt to model reality more closely, we study the Second-Price Ad Auctions problem, 
which is the analogue of the above allocation problem when bidders' payments are determined by 
the GSP mechanism and there is only one slot for each keyword. The GSP mechanism for a given 
keyword auction reduces to a second-price auction when there is one slot per keyword - given the 
participants in the auction, it allocates the advertisement slot to the highest bidder, charging that 
bidder the bid of the second-highest bidder. 

In the Second-Price Ad Auctions problem, we assume that there is a set of keywords U and a set 
of bidders V, where each bidder v € V has a known daily budget B v and a non-negative bid b u<v for 
every keyword u G U. The keywords are ordered by their arrival time, and as each keyword arrives, 
the algorithm (i.e., the search engine) chooses the bidders to participate in that particular auction 
and runs a second-price auction with respect to those participants. Thus, instead of selecting one 
bidder for each keyword, two bidders need to be selected by the algorithm. Of these two bidders, 
the bidder with the higher bid (where bids are always reduced to the minimum of the actual bid 
and the bidders' remaining budget) is allocated that keyword's advertisement slot at the price of 
the other bid. 

This process results in an allocation and pricing of the advertisement slots associated with each 
of the keywords. The goal is to select the bidders participating in each auction to maximize the 
total profit extracted by the algorithm. For an example instance of this problem, see Figure [TJ 

We note that for simplicity of presentation we have chosen to present the model, the algorithms 
and the results as if the bidders are competing for a single slot. Obviously, our hardness results 
hold for the multi-slot problem as well. 

1.1 Our Results 

We begin by considering the offline version of the Second-Price Ad Auctions problem, in which the 
algorithm knows all of the original bids of the bidders (Section [3]) . Our main result here is that 
it is NP-hard to approximate the optimal solution to this problem to within a factor better than 
m/Rmim where m is the number of keywords and R m in > 1 is a constant independent of m such 
that no bidder bids more than 1/R m i n of its initial budget on any keyword. Thus, even when bids 
are small compared to budgets, it is not possible in the worst case to get a good approximation 
to the optimal revenue. (We show that it is trivial to get a matching approximation algorithm.) 
This result stands in sharp contrast to the standard First-Price Ad Auctions problem, for which 
there is a 4/3- approximation to the offline problem [61 [23] (even for R m in = 1) ; and an e/(e — 1)- 
competitive algorithm to the online problem when bids are small compared to budgets [3 EI] (i-e., 

as Rrnin * Oo). 
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We then turn our attention to a theoretically appealing special case that we call Second-Price 
Matching. In this version of the problem, all bids are either or 1 and all budgets are 1. As before, 
the keywords are ordered by arrival time, and a keyword u can be matched to a bidder v with a 
profit of 1 only if b u ,v = 1, there is a distinct bidder v' with b uv i = 1, and neither v nor v' has been 
matched to a keyword that arrived before u. For an example instance of this problem, see Figure [2j 

Recall that for First-Price Matching or, as we know and love it, maximum bipartite matching, 
the offline problem can of course be solved optimally in polynomial time, whereas for the online 
problem we have the trivial 2-competitive deterministic greedy algorithm and the celebrated e/(e — 
l)-competitive randomized algorithm due to Karp, Vazirani and Vazirani |16j . both of which are 
best possible. 

In contrast, we show that the Second-Price Matching problem is APX-hard (Section |4.1[). We 



also give a 2-approximation algorithm for the offline problem (Section 4.2). We then turn to 
the online version of the problem. Here, we show that no deterministic online algorithm can 
get a competitive ratio better than m, where m is the number of keywords in the instance and 
that no randomized online algorithm can get a competitive ratio better than 2 (Section 5.1). On 
the other hand, we present a randomized online algorithm that achieves a competitive ratio of 
4y / e/(y / e — 1) ~ 5.08 (Section 5.2). To obtain this competitive ratio, we prove a generalization 
of the result due to Karp, Vazirani, and Vazirani [16] and Goel and Mehta [15] that the Ranking 
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Figure 1: An example of the Second-Price Ad Auctions problem: The nodes in U are keywords and the 
nodes in V are bidders. The number immediately to the right of each bidder represents its remaining budget, 
and the number next to each edge connecting a bidder to a keyword represents the bid of that bidder for 



that keyword. Figure 1(a) shows the situation when the first keyword arrives. For this keyword, the search 
engine selects the first bidder, whose bid is 4, and the third bidder, whose bid is 3. The keyword is allocated 



to the first bidder at a price of 3, thereby reducing that bidder's budget by 3. Figure 1(b) shows the situation 
when the second keyword arrives. The first and fourth bidders are selected, and the keyword is allocated to 
the fourth bidder at a price of 2, thereby reducing its remaining budget to 2. As each keyword arrives, the 
bid of a bidder for that keyword is adjusted to the minimum of its original bid and its remaining budget. 



Thus, for example, when the third keyword arrives, as shown in Figure 1(c) the bid of the first bidder for 



that keyword is adjusted from its original value of 6 down to 3 since that is its remaining budget. The two 
bidders then selected are the first and the second, and the keyword is allocated to the second bidder at a 
price of 3. 
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Figure 2: An example of the Second-Price Matching problem: The nodes in U are keywords, the nodes in 
V are bidders, and an edge (u, v) represents the fact that bidder v bids 1 for keyword u. The budget of each 
bidder is 1. In this example, neither of the last two keywords can be matched for profit. In particular, the 
fourth keyword cannot be matched for profit because it has only one neighbor that is not already matched 
(and thus no bidder available to act as a second-price bidder). 

algorithm for online bipartite matching achieves a competitive ratio of e/(e— 1). 
1.2 Related Work 

As discussed above, the related First-Price Ad Auctions problem^ has received a fair amount of 
attention. Mehta et al. [21] presented an algorithm for the online version that achieves an optimal 
competitive ratio of e/(e— 1) for the case when the bids are much smaller than the budgets (i.e., when 
Rmin — > oo), a result also proved by Buchbinder et al. [5L When there is no restriction on the values 
of the bids relative to the budgets (i.e., when R m in can be as low as 1), the best known competitive 
ratio is 2 [19]. For the offline version of the problem, a sequence of papers [El [2] [TT] EJ [Ml E] 
culminating in a paper by Chakrabarty and Goel, and independently, a paper by Srinivasan, show 
that the offline problem can be approximated to within a factor of 4/(4 — 1/Rmin) an d that there 
is no polynomial time approximation algorithm that achieves a ratio better than 16/15 unless 
P = NP%\. 

The most closely related papers to this one are the works of Abrams, Mendelevitch and Tom- 
lin [1], and of Goel, Mahdian, Nazerzadeh and Saberi [14J. The latter looks at the online allocation 
problem when the search engine is committed to charging under the GSP scheme, with multiple 
slots per keyword. They study two models, both of which differ from the model studied in this 
paper, even for one slot. Their first model, called the strict model, allows a bidder's bid to be 
above his remaining budget, as long as the remaining budget is strictly positive. However, as in 
our model, when a bidder is allocated a slot, that bidder is never charged more than his remaining 
budget. Thus, in the strict model, a bidder v with a negligible amount of remaining budget can 

1 This problem has also been called the Adwords problem E] and the Maximum Budgeted Allocation problem 
124) . It is an important special case of SMW 9 11 17 19, 22, 26 , the problem of maximizing utility in a combinatorial 
auction in which the utility functions are submodular, and is also related to the Generalized Assignment Problem 

(gap) mi ulna- 
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Figure 3: In this example, R m in is equal to a constant c, i.e., every bid is at most 1/c of the budget of the 
bidder. In the strict model, all keywords except the first must be allocated to the second bidder at a price 
of k (or the remaining budget if it's smaller). Thus, the total profit on this input for the strict model is at 
most ck + 3. On the other hand, in our model, if we allocate the first keyword to the second bidder, and 
then the next c — 1 keywords to the second bidder, that bidder's budget is reduced to k — 1. Thus, all of the 
remaining keywords can be allocated to the lower bidder at a price of k — 1, for a total revenue exceeding 
ck(k — 1). For k large, this ratio is f2(fc) = fi(m). 

keep his bids high indefinitely, and as long as bidder v is never allocated another slot, this high bid 
can determine the prices other bidders pay on many keywords^] Their second model, called the 
non-strict model, differs from the strict model in that bidders can keep their bids positive even after 
their budget is depleted. Thus, even after a bidder's budget is depleted, that bidder can determine 
the prices that other bidders pay on keywords indefinitely. However, a bidder is never charged more 
than his remaining budget for a slot. Therefore, if a bidder is allocated a slot after his budget is 
fully depleted, it gets the slot for free. 

Under the assumption that bids are small compared to budgets, Goel et al. present a e/(e — 1)- 
competitive algorithm for the non-strict model and a 3-competitive algorithm for the strict model. 
They also show that OPT s t r i c t < OPT non _ s t r i c t < 20PT s t r i c t, where these quantities refer to the 
optimal offline revenue of the search engine in their models. Their algorithms build on the linear 
programming formulation of Abrams et al. [1] for the offline version of the strict model. 

Interestingly, neither of their models or our model completely dominates the others in terms 
of the optimal offline revenue. However, it is fairly easy to show that neither OPT s t r i c t nor 
OPT non _ s trict are ever more than a constant factor larger than the optimal offline revenue from 
our model (see Appendix [A] for a proof of this). On the other hand, the optimal offline revenue 
in our model can be O(m) times as big as OPT non _ s t r i c t and OPT s t r i c t, where m is the number of 
keywords, and this holds even when R m in is a large constant c (see Figure [3]). This is not surpris- 
ing, given the strong hardness of approximation in our model, versus the fact that constant factor 
approximations are available, even online, in theirs. 

Notice also, that for the special case of Second-Price Matching the strict model and our model 

2 In terms of gaming the system, this would be a great way for a bidder to potentially force his competitors to pay 
a lot for slots without backing those payments up with budget. This effect is even worse for the non-strict model. 
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are identical, whereas the non-strict model reduces to standard maximum matching (on all those 
keywords which have at least two bidders bidding on them). 

We feel that our model, in which bidders are not allowed to bid more than their remaining 
budget, is more natural then the strict and non-strict models. It seems inherently unfair that a 
bidder with negligible or no budget should be able to indefinitely set high prices for other bidders. 
We recognize of course that there may be technical issues involving the scale and distributed nature 
of search engines that could make it difficult to implement our model precisely. 



2 Model and Notation 

We define the Second-Price Ad Auctions (2PAA) problem formally as follows. The input is a set 
of ordered keywords U and bidders V. Each bidder v G V has a budget B v and a nonnegative bid 
b uv for every keyword u £ U. We assume that all of bidder u's bids b uv are less than or equal to 
B v . 

Let B v (t) be the remaining budget of bidder v immediately after the i-th keyword is processed 
(so B v (0) = B v for all v), and let b UtV (t) = min(6 U)U , B v (t)). (Both quantities are defined induc- 
tively.) A solution (or second-price matching) to 2PAA chooses for the i-th keyword u a pair of 
bidders v\ and V2 such that b um {t — 1) > b u , V2 {t — 1), allocates the slot for keyword u to bidder 
Vi and charges bidder v\ a price of p[t) = b u>V2 (t — 1), the bid of v%. (We say that v% acts as the 
first-price bidder for u and V2 acts as the second-price bidder for u.) The budget of v\ is then 
reduced by p(t), so B Vl (t) = B Vl {t — 1) — pit). For all other bidders v ^ vi, B v (t) = B v [t — 1). The 
final value of the solution is ^2 t p(t), and the goal is to find a feasible solution of maximum value. 

In the offline version of the problem, all of the bids are known to the algorithm beforehand, 
whereas in the online version of the problem, keyword u and the bids b U)V for each v E V are 
revealed only when keyword u arrives, at which point the algorithm must irrevocably map u to a 
pair of bidders without knowing the bids for the keywords that will arrive later. 

The special case referred to as Second-Price Matching (2PM) is the special case where b UtV is 
either or 1 for all (u, v) pairs and B v = 1 for all v. Perhaps, however, it is more intuitive to think 
of it as a variant on maximum bipartite matching. Viewing it this way, the input is a bipartite 
graph G = (U UV, E), and the vertices in U must be matched, in order, to the vertices in V such 
that the profit of matching u £ U to v £ V is 1 if and only if there is at least one additional vertex 
v' G V that is a neighbor of u and is unmatched at that time. 

Note that in 2PM, a keyword can only be allocated for profit if its degree is at least two. 
Therefore, we assume without loss of generality that for all inputs of 2PM, the degree of every 
keyword is is at least two. 

For an input to 2PAA, let R m in = m i n u,u B v /b UjV , and let m = \U\ be the number of keywords. 



3 Hardness of Approximation of 2PAA 

In this section, we show that it is NP-hard to approximate 2PAA to a factor better than m/R m i n 
for any constant R m in independent of m (Theorem [I]) and provide a trivial algorithm with an 
approximation guarantee that meets this factor. 

For a constant c > 1, let 2PAA(c) be the version of 2PAA in which we are promised that 



Rmin > c. The proof of the following theorem (presented in full in Appendix C.l) uses a gadget 



similar to the instance presented in Figure [3} This gadget exploits the sensitivity of the problem to 
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small changes in budget. In particular, if the budget of a key bidder is reduced by a small amount, 
then the optimal revenue is much higher than if it is not reduced. We prove our inapproximability 
result by setting up the reduction so that the budget of this bidder is reduced if and only if the 
answer to the problem we reduce from is "yes." 

Theorem 1. Let c > 1 be a constant integer. For any constant d > c, it is NP-hard to approximate 
2PAA(c) to a factor ofm/c'. 



Proof. See Appendix C.l □ 



Theorem 2. Let c > 1 be a constant integer. There is an m/c- approximation to 2PAA(c). 

Proof. For each keyword u £ U, let s u be the second-highest bid for u. Consider the algorithm 
that selects the c keywords with the highest values of s u and then allocates these keywords to 
get s u for each of them (i.e., chooses the two highest bidders for u). Since no bidder bids more 
than 1/c of its budget for any keyword, no bids are reduced from their original values during this 
allocation. Hence, the profit of this allocation is at least (c/m) Ylueu Su ~ Since the value of the 
optimal solution cannot be larger than Ylueu Su > it follows that this is an m/c- approximation to 
2PAA(c). " □ 

4 Offline Second-Price Matching 

In this section, we turn our attention to the offline version of the special case of Second-Price 



Matching (2PM). First, in Section 4.1, we show that 2PM is APX-hard. This stands in contrast 



to the maximum matching problem, the corresponding special case of the First-Price Ad Auctions 



problem, which is solvable in polynomial time. Second, in Section 4.2, we give a 2-approximation 
for 2PM. 

4.1 Hardness of Approximation 

To prove our hardness result for 2PM, we use the following result due to Chlebfk and Chelbfkova. 

Theorem 3 (Chlebfk and Chelbfkova |8j). It is NP-hard to approximate Vertex Cover on 4-regular 
graphs to within 53/52. 

The precise statement of our hardness result is the following theorem. 

Theorem 4. It is NP-hard to approximate 2PM to within a factor of 364/363. 



Proof. See Appendix C.2 □ 



4.2 A 2- Approximation Algorithm 

Consider an instance G = {U U V, E) of the 2PM problem. We provide an algorithm that first 
finds a maximum matching / : U — > V and then uses / to return a second-price matching that 
contains at least half of the keywords matched by /J^] Given a matching /, call an edge (u, v) £ E 
such that f(u) ^ v an up-edge if v is matched by / and / _1 («) arrives before u, and a down-edge 

3 Note that / is a partial function. 
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otherwise. Recall that we have assumed without loss of generality that the degree of every keyword 
in U is at least two. Therefore, every keyword u £ U that is matched by / must have at least one 
up-edge or down-edge. Theorem [5] shows that the following algorithm, called ReverseMatch, is a 
2-approximation for 2PM. 



ReverseMatch Algorithm: 



Initialization: 

Find an arbitrary maximum matching f :XJ —* V on G. 
Constructing a 2nd-price matching: 

Consider the matched keywords in reverse order of their arrival. 
For each keyword u: 

If keyword u is adjacent to a down-edge (u,v): 

Assign keyword u to bidder f(u) (with v acting as the second-price bidder). 

Else: 

Choose an arbitrary bidder v that is adjacent to keyword u. 
Remove the edge from /. 

Assign keyword u to bidder f(u) (with v acting as the second-price bidder). 



Theorem 5. The ReverseMatch algorithm is a 2-approximation. 



Proof. See Appendix C.3 □ 



5 Online Second-Price Matching 

In this section, we consider the online 2PM problem, in which the keywords arrive one-by-one and 
must be matched by the algorithm as they arrive. We start, in Section |5.1| by giving a simple 
lower bound showing that no deterministic algorithm can achieve a competitive ratio better than 
m, the number of keywords. Then we move to randomized online algorithms and show that no 
randomized algorithm can achieve a competitive ratio better than 2. In Section [5.2[ we provide a 
randomized online algorithm that achieves a competitive ratio of 4y / e/(y / e — 1) ~ 5.083. 



5.1 Lower Bounds 

The following theorem establishes our lower bound on deterministic algorithms, which matches the 
trivial algorithm of arbitrarily allocating the first keyword to arrive, and refusing to allocate any 
of the remaining keywords. 

Theorem 6. For any m, there is an adversary that creates a graph with m keywords that forces 
any deterministic algorithm to get a competitive ratio no better than 1/m. 

Proof. The adversary shows the algorithm a single keyword (keyword 1) that has two adjacent 
bidders, a\ and &2- If the algorithm does not match keyword 1 at all, a new keyword 2 arrives 
that is adjacent to two new bidders a<i and 62. The adversary continues in this way until either 
m keywords arrive or the algorithm matches a keyword k < m. In the first case, the algorithm's 
performance is at most 1 (because it might match keyword m), whereas the adversary can match 
all m keywords. Hence, the ratio is at most 1/m. 
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In the second case, the adversary continues as follows. Suppose without loss of generality that 
the algorithm matches keyword k to ctfc. Then each keyword i, for k + 1 < i < m, has one edge 
to at and one edge to a new bidder c&. Since the algorithm cannot match any of these keywords 
for a profit, its performance is 1. The adversary can clearly match each keyword i for profit, for 
1 < i < k— 1, and if it matches keyword k to bk, then it can use as a second-price bidder for the 
remaining keywords to match them all to the Cj's for profit. Hence, the adversary can construct a 
second-price matching of size at least m. □ 

The following theorem establishes our lower bound of 2 for any (randomized) online algorithm 
for 2PM. 

Theorem 7. The competitive ratio of any randomized algorithm for 2PM must be at least 2. 



Proof. See Appendix C.4 □ 



5.2 A Randomized Competitive Algorithm 

In this section, we provide an algorithm that achieves a competitive ratio of 2y / e/(y / e — 1) ~ 5.083. 
The result builds on a new generalization of the result that the Ranking algorithm for online 
bipartite matching achieves a competitive ratio of e/(e — 1) ~ 1.582. This was originally shown by 
Karp, Vazirani, and Vazirani [16], though a mistake was recently found in their proof by Krohn 
and Varadarajan and corrected by Goel and Mehta [15] . 

The online bipartite matching problem is merely the first-price version of 2PM, i.e., the problem 
in which there is no requirement for there to exist a second-price bidder to get a profit of 1 for a 
match. The Ranking algorithm chooses a random permutation on the bidders V and uses that to 
choose matches for the keywords U as they arrive. This is described more precisely below. 



Ranking Algorithm: 

Initialization: 

Choose a random permutation (ranking) a of the bidders V. 

Online Matching: 

Upon arrival of keyword u € U: 

Let N(u) be the set of neighbors of u that have not been matched yet. 

If N(u) ^ 0, match u to the bidder v £ N(u) that minimizes cr{v). 

Karp, Vazirani, and Vazirani, and Goel and Mehta prove the following result. 

Theorem 8 (Karp, Vazirani, and Vazirani [TB] and Goel and Mehta [15 J. The Ranking algorithm 
for online bipartite matching achieves a competitive ratio of e/(e — 1) + o(l). 

In order to state our generalization of this result, we define the notion of a left k-copy of a 
bipartite graph G = (U U V, E). Intuitively, a left /c-copy of G makes k copies of each keyword 
u € U such that the neighborhood of a copy of u is the same as the neighborhood of u. More 
precisely, we have the following definition. 

Definition 9. Given a bipartite graph G = {Uq U V,Eq), define a left A;-copy of G to be a graph 
H = (Uh^V, Eh) for which \Ujj\ = k\Uc\ and for which there exists a map £ : Uh —> Uq such that 

• for each uq £ Uq there are exactly k vertices uh £ Uh such that C( u h) = U G> and 
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• for all uh G Uh and v G V, (uh, v) £ Eh if and only if (((uh),v) £ Eg- 

Our generalization of Theorem [8] describes the competitive ratio of Ranking on a graph H that 
is a fc-copy of G. It's proof, presented in Appendix [Bj is a modification to the proof of Theorem [8] 
presented by Birnbaum and Mathieu [3J. 

Theorem 10. Let G = (JJq U V, Eq) be a bipartite graph that has a maximum matching of size 
OPT\p, and let H = (Uh U V,Eh) be a left k-copy of G. Then the expected size of the matching 
returned by Ranking on H is at least 

WPT 1P (l- • 

Proof. See Appendix [Bj □ 



Using this result, we are able to prove that the following algorithm, called RankingSimulate, 
achieves a competitive ratio of 4y / e/(y / e — 1). 



RankingSimulate Algorithm: 


Initialization: 




Set M, the set of matched bidders, to 0. 




Set R, the set of reserved bidders, to 0. 




Choose a random permutation (ranking) a of the bidders V. 




Online Matching: 




Upon arrival of keyword u G U: 




Let N(u) be the set of neighbors of u that are not in M or R. 




If N(u) = 0, do nothing. 




If |JV(u)| = 1, let v be the single bidder in N(u). 




With probability 1/2, match u to v and add v to M, and 




With probability 1/2, add v to R. 




If |iV(u) > 2, let v\ and V2 be the two distinct bidders in N(u) 


that minimize cr(t>). 


With probability 1/2, match u to vi, add v\ to M, and add 


t>2 to R, and 


With probability 1/2, match u to i>2, add v\ to R, and add t>2 to M. 



Let G = (U G U U, Eg) be the bipartite input graph to 2PM, and let H = (U H U V, E H ) be a left 
2-copy of H. In the arrival order for H, the two copies of each keyword ug G U arrive in sequential 
order. 



Lemma 11. Fix a ranking a on V . For each bidder v G V, let X v be the indicator variable for 
the event that v is matched by Ranking on H, when the ranking is Let X' v be the indicator 
variable for the event that v is matched by RankingSimulate on G, when the ranking is a. Then 
E(X' V ) = X v /2. 

Proof. It is easy to establish the invariant that for all v G V, X v = 1 if and only if RankingSimulate 
puts v in either M or R. Furthermore, each bidder v G V is put in M or R at most once by 
RankingSimulate. The lemma follows because each time RankingSimulate adds a bidder v to M 
or R, it matches it with probability 1/2. □ 

4 Note that once a is fixed, X v is deterministic. 



9 



Theorem 12. The competitive ratio of Ranking Simulate is 2y / e/(y / e — 1) ~ 5.083. 



Proof. For a permutation a on V, let RankingSimulate(o") be the matching of G returned by 



RankingSimulate, and let Ranking(cr) be the matching of H returned by Ranking. Lemma 11 



implies that, conditioned on a, E(| RankingSimulate (er)|) = | Ranking(<r)|/2. By Theorem 10 



E(|RankingSimulate(a)|) = - E(| Ranking(cj)|) > OPT 1P (l - l/e 1/2 + o(l) 

Fix a bidder v £ V. Let P v be the profit from v obtained by RankingSimulate. Suppose that 
v is matched by RankingSimulate to keyword u £ Uq. Recall that we have assumed without loss 
of generality that the degree of u is at least 2. Let v' ^ v be another bidder adjacent to u. Then, 
given that v is matched to u, the probability that v' is matched to any keyword is no greater than 
1/2. Therefore, E(P„|w matched) > 1/2. Hence, the expected value of the second-price matching 
returned by RankingSimulate is 



^E(P„) = ^2E(P v \v matched) Fr(v matched) 
> - Fr(v matched) 



2 

= 2^(1 R' an kingSimulate(cr) | ) 

> l -OPT lP (l - 1/e 1 / 2 + o(l) 

> \0PT 2P (l-l/e l ' 2 + o{l) 
where OPT2P is the size of the optimal second-price matching on G. □ 



6 Conclusion 

In this paper, we have shown that the complexity of the Second-Price Ad Auctions problem is quite 
different from that of the more studied First-Price Ad Auctions problem, and that this discrepancy 
extends to the special case of 2PM, whose first-price analogue is bipartite matching. On the positive 
side, we have given a 2-approximation for offline 2PM and a 5.083-competitive algorithm for online 
2PM. 

Some open questions remain. Closing the gap between 2 and 364/363 in the approximability of 
offline 2PM is one clear direction for future research, as is closing the gap between 2 and 5.083 in 
the competitive ratio for online 2PM. Another question we leave open is whether the analysis for 
RankingSimulate is tight, though we expect that it is not. 
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A Discussion of Related Models 



In this section, we prove the statements claimed in the introduction regarding the relationships 
between the optimal solutions of our model and the strict and non-strict models of Goel et al. |14] . 
Given an instance A, let OPT2P be the optimal solution value in our model; let OPT s t T ict be the 
optimal solution value under the strict model; and let OPT non _ s t r i c t be the optimal solution value 
under the non-strict model. We prove the following theorem. 

Theorem 13. For any instance A, OPT non — strict 

< (2+1/ Rmax)OPT stnct < 8(2+1 /R max )OPT 2P . 

The first inequality follows from the work of Goel et al. [13], so we need only prove that 
OPT stnct <80PT 2 p. 

The core of our argument is a reduction from 2PAA to the First-Price Ad Auctions problem 
(lPAA)J^jin which only one bidder is chosen for each keyword and that bidder pays the minimum 
of its bid and its remaining budget. Given an instance A of 2PAA, we construct an instance A' of 
1PAA problem by replacing each bid b u ^ v by 

K,v - ., Kv ■ 

v'f=v : b u y<b u ,v 

Denote by OPT\p{A') the optimal value of the first-price model on A'. The following two lemmas 
prove Theorem [l3] by relating both OPT non ^ str i c t{A) and OPT 2 p(A) to OPTip(A'). 

Lemma 14. OPT non . strict (A) < OPT 1P (A'). 

Proof. For an instance A, we can view a non-strict second-price allocation of A as a pair of (partial) 
functions f\ and f 2 from the keywords U to the bidders V, where f\ maps each keyword to the 
bidder to which it is allocated and f 2 maps each keyword to the bidder acting as its second-price 
bidder. Thus, if f\{u) = v and f 2 {u) = v' then u is allocated to v and the price v pays is b u y. We 
have, for all such u, v, and v' , that b u y < b' uv . 

We construct the first-price allocation on A' defined by fi and claim that the value of this 
first-price allocation is at least the value of the non-strict allocation defined by f\ and f 2 . It suffices 
to show that for any bidder v, the profit that the non-strict allocation gets from v is at most the 
profit that the first-price allocation gets from v, or in other words, 

min I B v , ^ b uj 2 (u) I < min I B v , ^ b' u v 

\ u:fi(u)=v J \ u:f x {u)=v 

This inequality follows trivially from the fact that ^./ 2 ( M ) ^ fx(u) ^ or an a h° ca t e d keywords u, 
and hence the lemma follows. □ 

Lemma 15. OPT 1P (A') < 80PT 2P (A). 

Proof. Given an optimal first-price allocation of A' , we can assume without loss of generality that 
each bidder's budget can only be exhausted by the last keyword allocated to it, or, more formally, 
if u±, u 2 , ■ ■ ■ ttfe are the keywords that are allocated to a bidder v and they come in that order, then 



5 Recall that this problem has also been called the Adwords problem [21] and the Maximum Budgeted Allocation 
problem [IHEI]. 
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we can assume that Yli=i b ' UiV < B v . The reason for this is that if for some j < k, Yli=i b ' UiV < B v 
and Y%=i — B v , then we can ignore the allocation of Uj + i, . . . u k to v without losing any profit. 

With this assumption, we design a randomized algorithm that constructs a second-price allo- 
cation on A whose expected value in our model is at least 1/8 of the first-price allocation's value. 
Viewing the first-price allocation of A' as a (partial) function / from the keywords U to the bidders 
V and denoting by s(u, v) the bidder v' for which b v / u = b' vu , the algorithm is as follows. 

Random Construction: 

Randomly mark each bidder with probability 1/2. 
For each unmarked bidder v: 
Let S v = 0. 

For each keyword u such that fiu) = v. 

If s(u, v) is marked: S v = S v U {u}. 
Assume that S v = {ui,U2, ■ ■ ■ u k }, where u\, u?, . . . come in that order. 

IfEti^<£„: 

Let f\(ui) = v and f2(u{) = s(ui,v) for all i < k. 
Else: 

If Ei=i Kn,v ^ b 'u k ,v : let h( u i) = v and h{ui) = s{ui,v) for alH < k- 1. 
Else: let fi(u k ) = v and f2(u k ) = s(u k ,v). 

We claim that for the f\ and fi defined by this construction, whenever f(ui) is set to v, the 
profit from that allocation is b' u . v . This is not trivial because in our model, if a bidder's remaining 
budget is smaller than its bid for a keyword, it changes its bid for that keyword to its remaining 
budget. However, one can easily verify that in all cases, if we set f\[ui) = v and /2(«i) = s(ui,v), 
the remaining budget of v is at least b' u , v = b UiiS r UuV y Thus, the (modified) bid of fi(ui) for m is 
still at least the original bid of f2(ui) for m- 

We claim that the expected value of the second-price allocation defined by f\ and fa is at least 
1/80PTip(A'). For each bidder v, let X v be the random variable denoting the profit that fx and 
/2 get from v, and let Y v be the profit that / gets from v. We have OPT\p{A') = ^2 V Y V , so it 
suffices to show that E{X V ) > 1/&Y V for all v 6 V. 

Consider any v G V that is unmarked. Let T v = {u : f(u) = v}. If ^2 ue s v b' uv < B v then 
x v = Huas v b 'u,v If E«e5^ b 'u,v > B v then X v > E» e s„ 6 «,f/ 2 - Thus ' m both case ' we have 

is unmarked] > E[ b' u ^ v /2\v is unmarked] = b' u ^/4 = Y v /4 , 

which implies 

E[X V ] > E[X v \v is unmarked]Pr[w is unmarked] = 1/2 • Y v /4 = Y v /8 . 

□ 



B Proof of Theorem 10 



In this appendix, we provide a full proof of Theorem 10 The proof presented here is quite similar 
to the simplified proof of Theorem [8] presented by Birnbaum and Mathieu jl] . For intuition into 
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the proof presented here, the interested reader is referred to that workj^] 

Let G = (Ug U V, Eg) be a bipartite graph and let H = (Uh U V, Eh) be a left /c-copy of G. Let 
£ : Ujj — ► Ug be a map that satisfies the conditions of Definition [9j Let Mq Q Eq be a maximum 
matching of G. 

Let Ranking(/f, 7r, <t) denote the matching constructed on H for arrival order tt, when the 
ranking is a. Consider another process in which the vertices in V arrive in the order given by a and 
are matched to the available vertex u £ Uh that minimizes tt(u). Call the matching constructed 
by this process Ranking' ( H, tt, a). It is not hard to see that these matchings are identical, a fact 
that is proved in [16]. 



Lemma 16 (Karp, Vazirani, and Vazirani (16]). For any permutations tt and a, Ranking(//, tt, a) = 
Ranking' (if, tt, a). 

The following monotonicity lemma shows that removing vertices in H can only decrease the 
size of the matching returned by Ranking. 

Lemma 17. Let tth be an arrival order for the vertices in Uh, o,nd let an be a ranking on the 
vertices in V . Suppose that x is a vertex in Uh U V, and let H' = {Uh 1 ,Vh> , Eh>) = H \ {x}. 
Let tth' and oh 1 be the orderings of Uh 1 and Vh 1 induced by tth and o~h, respectively. Then 
Ranking(ff', tth 1 , o~h') < Ranking (H, tth, &h)- 

Proof. Suppose first that x G Uh- In this case, V = Vh' and o~h = &H'- Let Qt(H) C V 
be the set of vertices matched to vertices in Uh that arrive at or before time t (under arrival 
order tth and ranking an), and let Qt(H') C V be the set of vertices matched to vertices in 
Uh 1 that arrive at or before time t (under arrival order tth' and ranking an)- We prove by 
induction on t that Qt-i(H') C Q t (H), which by substituting t = n is sufficient to prove the 
claim. The statement holds when t = 1, since Qo(H') = 0. Now supposing we have Qt-2(H') C 
Qt-i(H), we prove Qt-i(H') C Q t (H). Suppose that t is at or before the time that x arrives 
in %h- Then clearly Qt-i(H') = Qt-i(H) C Qt{H). Now suppose that t is after the time that 
x arrives in tth- Let u be the vertex that arrives at time t — 1 in tth'- If u is n °t matched by 
R&nkmg(H' , it h',(th), then Q t -i{H') = Q t -2{H') C Q t _i(H) C Q t (H). Now suppose that u 
is matched by Ranking(/f', tth 1 , o~h), say to vertex v' . We show that v' € Qt(H), which by the 
induction hypothesis, is enough to prove that Qt-i(H') C Q t (H). Note that it arrives at time 
t in tth- Let w be the vertex to which u is matched by Ranking ( H, tth, o~h)- If f = v', we are 
done, so suppose that v ^ v'. Since v Qt-i(H), it follows by the induction hypothesis that 
v Qt—2(H'). Therefore, vertex v is available to be matched to u when it arrives in tth 1 - Since 
Ranking ( H' , tth'i °~h) matched u to v' instead, v' must have a lower rank than v in an- Since 
Ranking ( H, tth, o~h) chose v, vertex v' must have already been matched when vertex u arrived at 
time t in tth, or, in other words, v' £ Qt-i(H) C Q t (H). 

Now suppose that x £ V. In this case, Uh = Uh' and tth = tth'- Let Rt(H) C Uh be the set 
of vertices matched to vertices in V that are ranked less than or equal to t (under arrival order 
tth and ranking o~h), and let Rt(H') C Uh be the set of vertices matched to vertices in V that 



6 For those familiar with the proof in [1], the main difference between the proof of Theorem 10 presented here 
and the proof of Theorem [8] presented in [4] appears in Lemma 21 Instead of letting u be the single vertex that is 



matched to v by the perfect matching, as is done in [4], we choose u uniformly at random from one of the k vertices 
that correspond to the vertex that is matched to v by the perfect matching. The rest of the proof is essentially the 
same, but we present its entirety here for completeness. 
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are ranked less than or equal to t (under arrival order tth and ranking ajj')- Then by Lemma 16 
we can apply the same argument as before to show that Rt-i(H') C Rt(H) for all t, which by 
substituting t = n, is sufficient to prove the claim. □ 

We define the following notation. For all uq £ Uq, let £ (it<g) be the set of all uh £ Uh such 
that C( u h) = uq, and for any subset U' G C f/g, let £ _1 (?7 G ) be the set of all uh £ Uh such that 
((uh) £ f^G- ^he following lemma shows that we can assume without loss of generality that Mq 
is a perfect matching. 

Lemma 18. Let U' C C/g V Q V be the subset of vertices that are in Mq. Let G' be the 
subgraph of G induced by U' U V , and let H' be the subgraph of H induced by C -1 ^') U V . Then 
the expected size of the matching produced by Ranking on H' is no greater than the expected size of 
the matching produced by Ranking on H . 



Proof. The proof follows by repeated application of Lemma 17 for all x that are not in C 1 ^') U 



V. □ 



In light of Lemma [T8l to prove Theorem 10 it is sufficient to show that the expected size of 



the matching produced by Ranking on H' is at least (1 - l/e l / k - o(l))\M G \. To simplify notation, 
we instead assume without loss of generality that G = G', and hence G has a perfect matching. 
Let n = OPT\p = \Mq\ = \V\. Henceforth, fix an arrival order n. To simplify notation, we write 
Ranking(cr) to mean Ranking (H, ir, a). 

Let / : Uh — * V be a map such that for all v £ V, there are exactly k vertices u £ Uh such 
that f(u) = v. The existence of such a map / follows from the assumption that G has a perfect 
matching. For any vertex v £ V let f~ l (v ) be the set of u £ Uh such that f(u) = v. We proceed 
with the following two lemmas. 

Lemma 19. Let u £ Uh, and let v = f{u). For any ranking a, if v is not matched by Ranking(cr), 
then u is matched to a vertex whose rank is less than the rank of v in a. 

Proof. If v is not matched by Ranking(cr), then since there is an edge between u and v, it was 
available to be matched to u when it arrived. Therefore, by the behavior of Ranking, u must have 
been matched to a vertex of lower rank. □ 

Lemma 20. Let u £ Uh, and let v = f{u). Fix an integer t such that 1 < t < n. Let a be a 
permutation, and let a' be the permutation obtained from a by removing vertex v and putting it 
back in so its rank is t. If v is not matched by Ranking(cr'), then u must be matched by Ranking(o") 
to a vertex whose rank in a is less than or equal to t. 



Proof. For the proof, it is convenient to invoke Lemma 16 and consider Ranking'(cr) and Ranking'^') 
instead of Ranking(cr) and Ranking(o"'). In the process by which Ranking' constructs its matching, 
call the moment that the i th vertex in V arrives time t. For any 1 < s < n, let R s (cr) (resp., R s (a')) 



be the set of vertices in Uh matched by time s in a (resp., a'). By Lemma 19 if v is not matched 
by Ranking(<T / ), then u must be matched to a vertex v' in Ranking^') such that cr'{v') < o~'(v). 
Hence u £ Rt~\{a'). We prove the lemma by showing that R t -\{a') C Rt(a). Let t be the time 
that v arrives in a. Then if t > t, the two orders a and a are identical through time t, which implies 
that Rt-x{a') = Rt-i(a) Q R t (a). 

Now, in the case that t < t, we prove that for 1 < s < t, R s -i(<j') Q R s (a). The proof, which is 



similar to the proof of Lemma 17 proceeds by induction on s. When s = 0, the claim clearly holds, 
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since Ro(a') = 0- Now, supposing that R s -2{o~') C R s _i(a), we prove that C R s (a). If 

s <t, then the two orders <r and cr' are identical through time s, so R s -i(a') = R s -i(a) C R s (a). 
Now suppose that s > t. Then the vertex that arrives at time s — 1 in a' is the same as the 
vertex that arrives at time s in a. Call this vertex w. If w is not matched by Ranking' (cr'), then 
R s -i(a') = R s -2(&'), and we are done by the induction hypothesis. Now suppose that w is matched 
to vertex x' by Ranking' (o~') and to vertex x by Ranking'(cr). If x = x', then again we are done 
by the induction hypothesis, so suppose that i/i'. Since x was available at time s — 1 in a, we 
have x R s -i(a), and by the induction hypothesis x R s -2(cr'). Hence, x was available at time 
s — 1 in a'. Since Ranking'(o~') matched w to x' , it must be that ir{x') < tt{x). This implies that 
x' must be matched when w arrives at time s in o~, or in other words, x' G R s -\(a) C R s (a). By 
the induction hypothesis, we are done. □ 

Lemma 21. For 1 < t < n, Zei denote the probability over a that the vertex ranked t in V is 
matched by Ranking(cr) . Then 

1 * 

1 - X t < ^— }Xs ■ (1) 

kn 

8=1 

Proof. Let a be permutation chosen uniformly at random, and let a' be a permutation obtained 
from cr by choosing a vertex v £ V uniformly at random, taking it out of a, and putting it back 
so that its rank is t. Note that both a and a' are distributed uniformly at random among all 
permutations. Let u be a vertex chosen uniformly at random from Note that conditioned 

on a, u is equally likely to be any of the kn vertices in Ujj- Let Rt be the set of vertices in Uh 
that are matched by Ranking(a) to a vertex of rank t or lower in a. Lemma [20] states that if v 
is not matched by Ranking(cr'), then u S Rt- The expected size of Rt is X^i<s<i 2; s- Hence, the 
probability that u £ Rt, conditioned on a, is (l/(kn)) J2i< s <t x s- The lemma follows because the 
probability that v is not matched by Ranking 



We are now ready to prove Theorem 10 



Proof of Theorem 10 For < t < n, let St = Yli< s <t Xs - Then the expected size of the matching 
returned by Ranking on H is S n . Rearranging ([TJ yields, for 1 < t < n, 

s 'K^tt) (1 + s '- i) ' 

which by induction implies that St > J2i< s <t(^ n /(^ n + 1))*' anc ^ hence 

Sn >- t (j^)' - ^ (l - (l - ^LY") = *» (x - ^ + 0(1 , 



□ 



C Other Missing Proofs 

In this appendix, we provide the missing proofs from the body of the paper. 
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C.l Proof of Theorem [Q 

Fix a constant d > c, and let no be the smallest integer such that for all n > no, 

c' • V 9 n ; > c n 3 + cn 2 + n + 2 2 

cn 2 + n + 2 K ; v ' 

and 

n/2 + 1 , , 

(3) 

Note that since no depends only on c', it is a constant. 

We reduce from PARTITION, in which the input is a set of n > no items, and the weight of 
the i-th item is given by w,. If W = Y27=l w ii then the question is whether there is a partition of 
the items into two subsets of size n/2 such that the sum of the w^s in each subset is W/2. It is 
known that this problem (even when the subsets must both have size n/2) is NP-hard |13j . 

Suppose that there is an m/c'-approximation algorithm to 2PAA(c); we will show that con- 
structing the following instance of 2PAA(c) (illustrated in Figure |4| allows us to use the m/c'- 
approximation to solve the PARTITION instance: 

• First, create n + 2 keywords c±, . . . , c n , ei, ei- Second, create an additional set 

G = {§i,k '■ 1 < i < n 2 and 1 < k < c} 
of cn 2 keywords. The keywords arrive in the order 

ci, . . . , Cn, ei, e 2 ,5i,i' • • • i9l,c ,9n 2 ,ii ■ ■ ■ >9n 2 ,c ■ 

• Create n 2 + 4 bidders a, d±, d,2, f, h\, . . . , h n 2. Set the budgets of a, d%, and d,2 to cW(l + n/2). 
Set the budget of / to cW(n 3 + 1). For 1 < i < n 2 , set the budget of hi to cWn 3 . 

• For 1 < i < n, bidders a, d\, and bid c(wi + W) on keyword q. 

• For j G {1,2}, bidder dj bids cW on keyword ej. Bidder / bids cW/2 on both e\ and e2- 

• For 1 < i < n 2 and 1 < k < c, keyword receives a bid of W(n s + 1) from bidder / and a 
bid of Wn 3 from bidder hi. 

This reduction can clearly be performed in polynomial time. Furthermore, it can easily be checked 
that ([3| implies that no bidder bids more than 1 /c of its budget on any keyword. 

We first show that if the PARTITION instance is a "yes" instance, then there exists a feasible 
solution to the 2PAA(c) instance of value at least cW(n 5 + n + 2). Let U C [n] be such that 

\U\ = n/2 and EieU™i = Z ie u w i = W / 2 - 

We construct a solution to the 2PAA(c) instance as follows. For every i S U, allocate Cj to d\, 
and for every i G U, allocate q to c?2- For each of these allocations, choose a as the second-price 
bidder. This will reduce the budget of d\ and c?2 to exactly cW/2, and hence the bids from d\ 
to e\ and from c?2 to e2 will be both be reduced to cW/2. Allocate e\ to / choosing d\ as the 
second-price bidder, and allocate e2 to / choosing d<i as the second-price bidder. This will reduce 
the budget of / to cWn 3 . The profit from the solution constructed so far is cW{n+2). Now allocate 
51,1,51,2, • • • , 5i c-l to /, choosing h\ as the second-price bidder. This will reduce the budget of / 
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Sn2,c 



Figure 4: The 2PAA(c) instance of the reduction. Each bidder's budget is shown above its node, and the 
bids of bidders for keywords is shown near the corresponding edge. 

to Wn 3 . Hence, it can act as the second-price bidder for each of the remaining keywords in G. 
Allocate g\ fi to h\, choosing / as the second-price bidder, and then, for 2 < i < n 2 and 1 < k < c, 
allocate g^k to hi, choosing / as the second-price bidder. The profit obtained for each keyword 
in G in this assignment is Wn 3 . Since |G| = cn 2 , the total profit of the solution constructed is 
cW(n + 2) + cWn 5 = cW(n 5 + n + 2). 

We now show that if there is a second-price matching in the 2PAA(c) instance of value at least 
cW(n 3 + cn 2 + n + 2), then there must be a partition of w±, . . . , w n . In such a matching, at most 
cW{n + 2) units of profit can be obtained from keywords c±, . . . ,c n ,e\, e2, since the initial second- 
highest bids on those keywords sum to cW(n + 2). Hence, at least cW(n 3 + cn 2 ) profit must come 
from the keywords in G. 

Suppose that the budget of / is greater than cWn 3 after keywords e\ and are allocated. Note 
that at least c of the keywords in G must be allocated to reach a profit of cW(n 3 + cn 2 ) on these 
keywords. Consider what happens after the first c of the keywords in G are assigned. For each 
of these keywords, / must have been the first-price bidder, so its budget is reduced to an amount 
greater than and less than or equal to cW. Hence, for each keyword in G allocated henceforth, 
/ is the second-price bidder, and the profit is at most cW. Since there are at most c(n 2 — 1) 
more keywords in G, the total profit from the keywords in G is at most cWn 3 + c 2 W(n 2 — 1), 
which contradicts the fact that at least cW(n 3 + cn 2 ) units of profit must come from G. Hence, we 
conclude that the budget of / is less than or equal to cWn 3 after keywords e\ and ei are allocated. 

The budget of / can only be smaller than cWn 3 if / acts as the first-price bidder for both e\ 
and e2- But this can happen only if the budgets of both d\ and di are reduced to an amount less 
than or equal to cW/2. For j G {1,2}, let Uj C [n] be the set of indices i such that dj acts as the 
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first-price bidder for i. For both j, we have that 

E,„. cW cWn 
c( Wi + W)>— + —. (4) 

Rearranging Q yields ^ ieU . W > W/2+Wn/2-Y^ eUj m, which implies W\Uj\ > W/2 + Wn/2- 
W, and hence \Uj\ > n/2 — 1/2. By integrality, then, \Uj\ > n/2 for both j. Hence \Uj\ = n/2 
for both j, and using Q again, we have J2ieUj CWi + c W\Uj\ > cW/2 + cWn/2 which implies that 
^2i£U w i > W/2 for both j. Therefore, the partition defined by U\ and U 2 is a solution to the 
PARTITION instance. 

To conclude the proof, note that the number of keywords in the 2PAA(c) instance is cn 2 + 
n + 2. Hence, if the PARTITION instance is a "yes" instance, then by (|2]), we can run the m/c'- 
approximation algorithm to find a second-price matching of value at least cW(n s + cn 2 + n + 2), 
and if the PARTITION instance is a "no" instance, then the value of the solution returned by the 
algorithm must be strictly less than cW(n 3 -\-cn 2 + n + 2). Hence, an m/c'-approximation algorithm 
for 2PAA(c) can be used to solve PARTITION. 



C . 2 Proof of Theorem g] 

The proof is by a reduction from vertex cover. Given a graph G, we construct an instance f{G) 
of 2PM as follows. First, for each edge e € E(G), we create a keyword with the same label (called 
an edge keyword), and for each vertex v E V(G), we create a bidder with the same label (called 
a vertex bidder). Bidder v bids for keyword e if vertex v is one of the two end points of edge e. 
(Recall that in 2PM, if a bidder makes a non-zero bid for a keyword, that bid is 1.) In addition, 
for each edge e, we create a unique bidder x e who also bids for e. Furthermore, for each vertex v, 
we create a gadget containing two keywords h v and l v and two bidders y v and z v . We let v and y v 
bid for h v ; and y v and z v bid for l v . The keywords arrive in an order such that for each v £ V(G), 
keyword h v comes before l v , and the edge keywords arrive after all of the h^s and l^s have arrived. 
An example of this reduction is shown in Figure [5] 

The following lemma provides the basis of the proof. 

Lemma 22. Let OPTyc an d OPT2P be the size of the minimum vertex cover of G and the 
maximum second-price matching on f{G), respectively. Then 

OPT 2P = 2\V(G)\ + \E{G)\ - OPT vc 

Proof. We first show that given a vertex cover S of size OPTyc of G, we can construct a solution 
to the 2PM instance whose value is 2|1^(G)| + |i?(G)| — OPTyc- For each vertex v £ S, we allocate 
h v to v (with y v acting as the second-price bidder) and to y v (with z v acting as the second-price 
bidder), getting a profit of 2 from the gadget for v. For each vertex v £ S, we allocate h v to y v (with 
v acting as the second-price bidder) and ignore l v , getting a profit of 1 from the gadget for v. We 
then allocate each edge keyword e to x e . During each of these edge keyword allocations, at least one 
of the two vertex bidders that bid for e is still available, since S is a vertex cover. Hence, for each 
of these allocations, there is a bidder that can act as a second-price bidder, and the profit from the 
allocation is 1. This allocation yields a second-price matching of size 2| 1/(G) | + |-E(G)| — OPTyc- 
Therefore, OPT 2P > 2\V{G)\ + \E(G)\ - OPT vc . 
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To show that OPT2P < 2|V(G)| + |i?(G)| — OPTyc, we start with an optimal solution to 
/(G) of value OPT 2P and construct a vertex cover of G of size 2\V(G)\ + \E{G)\ - OPT 2P . To 
do this, we first claim that there exists an optimal solution of /(G) in which every edge-keyword 
is allocated for a profit of 1. Consider any optimal second-price matching of the instance. Let e 
be an edge-keyword e that is not allocated for a profit of 1. If it is adjacent to a vertex bidder 
that is unassigned when e arrives, then e can be allocated to x e for a profit of 1, which can only 
increase the value of the solution. Suppose, on the other hand, that both of its vertex bidders are 
not available when e arrives. Let v be a vertex bidder that bids for e. Since it is not available, 
h v must have been allocated to v. We can transform this second-price matching to another one 
in which h v is assigned to y v , l v is ignored and e is assigned to x e , with v acting the second-price 
bidder in both cases. This does not decrease the total profit of the solution. Hence, we can perform 
these transformations for each edge keyword e that is not allocated for a profit of 1 until we obtain 
a new optimal solution in which each edge keyword is allocated for a profit of 1. 

Now consider an optimal second-price matching in which all edge keywords are allocated for a 
profit of 1. Let T C V{G) be the set of vertices represented by vertex bidders that are not allocated 
any keywords in this second- price matching. Then \T\ = 2|V(G)| + |i£(G)| — \OPT 2 p\, and T is a 
vertex cover, which implies OPTyc < 2|V(G)| + |i£(G)| — OPT 2 p. The lemma follows. □ 

Now, suppose that we have an a- approximation for 2PM. We will show how to use this approx- 
imation algorithm and our reduction to obtain an ((8a — 7)/a)-approximation for Vertex Cover on 
4-regular graphs. By Theorem [3j this means that (8a — 7) /a > 53/52, and hence a > 364/363, 
unless P = NP. 

To construct this ((8a — 7)/a)-approximation algorithm, given a 4-regular graph G, run the 
above reduction to obtain a 2PM instance /(G). Then use the a-approximation to obtain a second- 
price matching M whose value is at least OPT 2 p/a. Now, just as in the proof of Lemma 22 we can 



assume that in M, every edge keyword e is allocated to x e . Hence, the set of vertices T associated 
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with the vertex bidders that are not allocated a keyword form a vertex cover, and 



T\ < 2\V{G)\ + \E(G)\-OPT 2P /a 

= 2\V{G)\ + \E{G)\ - (2\V{G)\ + \E{G)\ - OPT vc )/a 
= (1 - l/a)(2\V(G)\ + \E(G)\) + OPT vc /a 



(5) 



Since G is 4-regular, we have OPT vc > m/4 = (2\V(G)\ + \E(G)\)/8, and hence by @, we conclude 
that \T\ < ((8a — 7) /a)OPTyc, which finishes the proof of the theorem. 

C.3 Proof of Theorem H 

Since the number of vertices matched by / is an upper bound on the profit of the maximum second- 
price matching on G, we need only to prove that the second-price matching contains at least half 
of the keywords matched by /. By the behavior of the algorithm, it is clear that whenever a vertex 
u is matched to f{u) in the second-price matching, the profit obtained is 1. Furthermore, every 
time an an edge is removed from /, a new keyword is added to the second-price matching. Thus, 
the theorem follows. 

C.4 Proof of Theorem [7] 

We invoke Yao's Principle [27] and construct a distribution of inputs for which the best determin- 
istic algorithm achieves an expected performance of (asymptotically) 1/2 the value of the optimal 
solution. 

Our distribution is constructed as follows. The first keyword arrives, and it is adjacent to two 
bidders. Then the second keyword arrives, and it is adjacent to one of the two bidders adjacent 
to the first keyword, chosen uniformly at random, as well as a new bidder; then the third keyword 
arrives, and it is adjacent to one of the bidders adjacent to the second keyword, chosen uniformly at 
random, as well as a new bidder; and so on, until the m-th keyword arrives. We call this a normal 
instance. To analyze the performance of the online algorithms, we also define a restricted instance 
to be one that is exactly the same as a normal instance except that one of the two bidders of the 
first keyword is marked unavailable, i.e., he can not participate in any auction. 

Clearly, an offline algorithm that knows the random choices beforehand can allocate each key- 
word to the bidder that will not be adjacent to the keyword that arrives next. In this way, it can 
ensure that for each keyword, there is a bidder that can act as a second-price bidder. Hence for a 
normal instance, the optimal second-price matching obtains a profit of to. 

Consider the algorithm Greedy, which allocates a keyword to an arbitrary adjacent bidder if 
and only if there is another available bidder to act as a second-price bidder. Our proof consists 
of two steps: first, we will show that the expected performance of Greedy on the normal instance 
is (to + l)/2, and second we will prove that Greedy is the best algorithm in expectation for both 
types of instances. 

Let X k and Y k * be the expected profit of Greedy on a normal and a restricted instance of k 
keywords, respectively (where Xq and Yq are both defined to be 0). Given the first keyword of a 
normal instance, Greedy allocates it to an arbitrary bidder. Then, with probability 1/2, it is faced 
with a normal instance of k — 1 keywords, and with probability 1/2, it is faced with a restricted 
instance of k — 1 keywords. Therefore, for all integers k > 1, 



X* k = l/2(X* k _ 1 + Y k *_ 1 ) + l 



(6) 
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On the other hand, given the first keyword of a restricted instance, Greedy just waits for the 
second keyword. Then, with probability 1/2, the second keyword chooses the marked bidder, giving 
Greedy a restricted instance of k— 1 keywords, and with probability 1/2, the second keyword chooses 
the unmarked bidder, giving Greedy a normal instance of k — 1 keywords. Therefore, for all k, 

n* = V2(*iU+ifc-i) • (7) 

From @ and Q we have, for all k, 

Y k * = X* k - 1 . (8) 

Plugging Q for k = to — 1 into ([6]) for k = m yields 

X* m = X* m _, + 1/2 , (9) 

and hence, by induction X* m = (to + l)/2. 

Now, we prove that Greedy is the best among all algorithms on these two types of instances. 
In fact, we make it easier for the algorithms by telling them beforehand how many keywords in 
the instance they will need to solve. Let X m and Y m be the expected number of keywords in the 
second-price matching produced by the best algorithms that "know" that they are solving a normal 
instance of size m and a restricted instance of size m, respectively. Let A m and B m denote these 
optimal algorithms. 

We prove that X m < X^ and Y m < Y^ for all to by induction. The base case in which to = 1 is 
easy, since no algorithm can obtain a profit of more than one on a normal instance of one keyword 
or more than zero on a restricted instance of one keyword. We now prove the induction step. 

First, consider A m - When the first keyword arrives, A m has two choices: either ignore it or 
allocate it to one of the bidders. If A m ignores the first keyword, its performance is at most the 
performance of A m -i on the remaining keywords, which constitute a normal instance of to — 1 
keywords. On the other hand, if A m allocates the first keyword to one of the bidders, then with 
probability 1/2, it is faced with a normal instance of to — 1 keywords, and with probability 1/2 it 
is faced with a restricted instance of to — 1 keywords. The performance of A m on these instance is 
at most the performance of A m -i and B m -i, respectively. Thus, by the induction hypothesis, Q, 
and @, we have 

X m < max{X m _i, l/2(X m _i + Y m -\) + 1} 
< max{X*_ 1 ,l/2(X*_ 1 + y^_ 1 ) + l} 
= max^.l^+I^-lj + l} 
= X* m _ 1 + l/2 

V"* 

Next, consider B m . When the first keyword arrives, B m cannot allocate it for a profit. If it 
allocates it for a profit of 0, then it is faced with a restricted instance of to — 1 keywords. If it does 
not allocate the keyword, then with probability 1/2, B m is faced with a normal instance of to — 1 
keywords, and with probability 1/2, it is faced with a restricted instance of to — 1 keywords. Its 
performance on these instances is at most those of A m -i and B m -i, respectively. Thus, by the 
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induction hypothesis and Q, we have 



Y 

1 m 



This completes our proof. 



< max{ Y m _ i,l/2 (X m _ i + Y m - 

< max{y*_ 1 ,l/2K_ 1 + lO- 

- 1 m ' 
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